Analyzing the Privacy Preserving Using Big Data Techniques
نویسندگان
چکیده
Recently big data has become a hot research topic. The rising amounts of big data also increase the chance of violate the privacy of individuals. Since big data need high computational power and large storage, distributed systems are used. As multiple parties are concerned in these systems, the risk of privacy violation is improved. There have been a number of privacy-preserving methods developed for privacy protection at different stages (e.g., data generation, data storage, and data processing) of a big data life cycle. The goal of this paper is to provide a comprehensive overview of the privacy preservation methods in big data and present the challenge for existing mechanisms. In this paper, we illustrate the infrastructure of big data and the big data life cycle. Furthermore, we discuss the research challenges of the privacy preservation in big data. This paper also presents recent techniques of privacy preserving in big data like hiding a needle in a haystack, identity based anonymization, differential privacy and privacy-preserving big data publishing of big data streams and future research directions related to big data.
منابع مشابه
Privacy and Security of Big Data in THE Cloud
Big data has been arising a growing interest in both scien- tific and industrial fields for its potential value. However, before employing big data technology into massive appli- cations, a basic but also principle topic should be investigated: security and privacy. One of the biggest concerns of big data is privacy. However, the study on big data privacy is still at a very early stage. Many or...
متن کاملA Survey on Big Data & Privacy Preserving Publishing Techniques
Big data describes very large data sets that have more diverse and complicated structure like weblogs, social media, email, sensors, and photographs. These less structured data and distinctiveness characteristics from traditional databases typically associated with extra complications in storing, analyzing and applying further procedures or extracting results. Big data analytics is the process ...
متن کاملPrivacy and Security of Big Data in THE Cloud
Big data has been arising a growing interest in both scien- tific and industrial fields for its potential value. However, before employing big data technology into massive appli- cations, a basic but also principle topic should be investigated: security and privacy. One of the biggest concerns of big data is privacy. However, the study on big data privacy is still at a very early stage. Many or...
متن کاملParallelizing K-Anonymity Algorithm for Privacy Preserving Knowledge Discovery from Big Data
Disclosure control has become inevitable as privacy is given paramount importance while publishing data for mining. The data mining community enjoyed revival after Samarti and Sweeney proposed k-anonymization for privacy preserving data mining. The k-anonymity has gained high popularity in research circles. Though it has some drawbacks and other PPDM algorithms such as l-diversity, t-closeness ...
متن کاملOpportunities and Challenges for Privacy-Preserving Visualization of Electronic Health Record Data
In this paper, we reflect on the use of visualization techniques for analyzing electronic health record data with privacy concerns. Privacy-preserving data visualization is a relatively new area of research compared to the more established research areas of privacy-preserving data publishing and data mining. We describe the opportunities and challenges for privacy-preserving visualization of el...
متن کامل